SPA: Web-based Platform for easy Access to Speech Processing Modules

نویسندگان

  • Fernando Batista
  • Pedro Curto
  • Isabel Trancoso
  • Alberto Abad
  • Jaime Ferreira
  • Eugénio Ribeiro
  • Helena Moniz
  • David Martins de Matos
  • Ricardo Ribeiro
چکیده

This paper presents SPA, a web-based Speech Analytics platform that integrates several speech processing modules and that makes it possible to use them through the web. It was developed with the aim of facilitating the usage of the modules, without the need to know about software dependencies and specific configurations. Apart from being accessed by a web-browser, the platform also provides a REST API for easy integration with other applications. The platform is flexible, scalable, provides authentication for access restrictions, and was developed taking into consideration the time and effort of providing new services. The platform is still being improved, but it already integrates a considerable number of audio and text processing modules, including: Automatic transcription, speech disfluency classification, emotion detection, dialog act recognition, age and gender classification, non-nativeness detection, hyperarticulation detection, dialog act recognition, and two external modules for feature extraction and DTMF detection. This paper describes the SPA architecture, presents the already integrated modules, and provides a detailed description for the ones most recently integrated.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ECESS Platform for Web Based TTS Modules and Systems Evaluation

The paper presents platform for web based TTS modules and systems evaluation named RES (Remote Evaluation System). It is being developed within the European Centre of Excellence for Speech Synthesis (ECESS, www.ecess.eu). The presented platform will be used for web based online evaluation of various text-to-speech (TTS) modules, and even complete TTS systems, presently running at different Inst...

متن کامل

Making Czech Historical Radio Archive Accessible and Searchable for Wide Public

In this paper we describe a complex software platform that is being developed for the automatic transcription and indexation of the Czech Radio archive of spoken documents. The archive contains more than 100.000 hours of audio recordings covering almost ninety years of public broadcasting in the Czech Republic and former Czechoslovakia. The platform is based on modern speech processing technolo...

متن کامل

A voice user interface demonstration system for mexican Spanish

We present a Mexican Spanish voice user interface demonstration system. It was built on a speech research platform developed at Bell Labs, which provides major speech technology and interface components, including automatic speech recognition, text-to-speech synthesis, audio input/output functions and telephone interface. The application is written in the PERL script language with an embedded V...

متن کامل

WWWTranscribe - a modular transcription system based on the world wide web

WWWTranscribe is a transcription system based on the WWW. It is platform independent and allows network access to speech databases. Its modular structure make it flexible, and it connects easily to existing signal processing applications or database management systems. WWWTranscribe consists of static HTML documents containing forms. To these forms CGI applications are attached that perform dat...

متن کامل

Towards the next generation of speech tools and corpora

This special edition picks up the theme 16 years after Bird and Harrington (2001) of current developments in software tools for processing speech and language data. The main objective now is much as it was then: to design and make freely available tools that are independent of the research task and computing environment for creating, annotating, querying, and analysing data from extensive speec...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016